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1.  Introduction.  Let  {X  }  =  {X  ;  n  =  0,+l,...}  be  a  q-variate  weakly  stationary 
— -  n  n  — 

stochastic  process  (WSSP)  with  spectral  distribution  (matrix)  F(X),  -tt  <  X  <_  tt. 

6.f'j  St*ck  d  jf.  «■  ^  f  jj  5" S’  P) 

An  important  problem  in  prediction  theory  of  such  processes^Ts  to  find  conditions 
on  the  process,  or  equivalently  on  its  spectral  distribution  F,  so  that  the 
linear  least  square  predictor  of  a  future  value  of  the  process  admits  a 
mean-convergent  series  representation  in  terms  of  the  past  (observed)  values 
of  the  process.  This  problem  was  solved  by  Wiener  and  Masani  [18J,  and  Masani 
■  -  by  imposing  some  boundedness  conditions  on  f,  the  spectral  density  (matrix) 
function  of  the  process. 

Recently,  using  the  notion  of  positivity  of  the  angle  between  the  past- 
present  and  the  future  subspaces  of  the  process  it  was  shown  by  Pourahmadi 
*{-1-3-;  W-T-kirf-that  the  series  representation  of  the  predictor  is  possible  under 
some  weaker  conditions.  This  was  made  possible  by  using  the  idea  of  angle  due 
to  Helson  and  SzegcT  [5]  (see  also  Miamee  (10]  for  a  multivariate  extension  of 

this).  However  these  results  hold  under  conditions  which  require  the  process 

Hirr.  /.oci/r*  i/  r 

to  be  of  full  rank.  The  main  purpose  of  the  present  study'  is  to  consider  the 
same  problem,  including  their  autoregressive  representation,  for  the  degenerate 


WSSP '  s  .  rrJ*  ■  Xvnrv*  jc-  ■'  -Ay- 

-  0  <- 

Now  we  explain  the  results  of  this  paper  . 


--ffu.'V 


c 


It  is  well-known  [7]  that  every  purely  nondetermini sti c  WSSP  has  a 
one-sided  moving  average  representation  as 


(1.1) 


X  =  c  +  (’  c  ,  + 
n  n  1  n- 1 


-■  -J0C kVk-  'V". 


where  (f  r  is  the  innovation  process  of  {X  }  and  { C,  }  is  a  sequence  of  q*q 


l  tr  C.C*  <  co. 
k=0  K  K 


constant  matrices  with 


2 


This  moving  average  representation  plays  an  extremely  important  role  in 
prediction  theory  and  statistical  analysis  of  {X^}.  For  example,  from  (l.lj 

A 

one  can  obtain  the  v-step  ahead  predictor  of  X^+v  (denoted  by  Xn+^)  based  on 

V  Vi*--  as 


(1.2) 


(1.3) 


x  =  y  c,  e 


n+v  ,  L  k  n+v-k * 
k=v 


X  -  X  =  y  c,  e  .. 
n+v  n+v  .  Ln  k  n+v-k 
k=0 


Also  the  moving  average  representation  (1.1)  is  used  in  studying  the  limiting 

distribution  of  certain  statistics,  such  as  the  estimators  of  the  autocovariance 

and  the  spectral  density  matrix,  which  are  useful  in  the  analysis  of  time 

series  data  collected  from  (X  }. 

n 

To  render  the  moving  average  representation  (1.1)  and  the  form  of  the  best 
linear  predictor  fully  satisfactory  we  should  be  able  to  express  the  innovation 
process  in  terms  of  the  past  (observed)  values  of  the  process  X^  itself,  so 
that  the  best  linear  predictor  would  be  also  expressed  in  terms  of  these 
observed  values  of  the  process. 


1.4  Definition.  We  say  that  the  moving  average  processes  in  (1.1)  has  an 
autoregressive  verves  entail  on  if  there  exists  a  sequence  {A^}  of  qxq  matrices 
such  that 


e  =  y  A.x 
n  .  k  n-k 
k=0 


where  the  infinite  scries  is  to  converge  in  the  mean. 

It  is  useful  to  note  that  (1.1)  can  be  viewed  as  a  stochastic  difference 
equation  in  icn)  with  as  the  input.  The  existence  of  the  autoregressive 


representation  (1.51  assures  that  the  difference  equation  (1.1)  has  a  solution 

and  therefore  (1.5)  can  be  viewed  as  a  stochastic  difference  equation  in  {X^j 

with  e  as  the  input,  i.e.  the  roles  of  X  and  e  can  be  reversed.  Due  to 
n  r  n  n 

this  reversal  of  roles,  the  term  invertibility  of  the  moving  average  (1.1)  is 
sometimes  used  in  the  literature  on  time  series,  instead  of  the  autoregressive 
representat ion . 

Note  that  if  {X^}  has  an  autoregressive  representation,  then  it  follows 
from  (1.51  that  the  one  step  ahead  predictor  of  { Xn }  satisfies  the  equation 

^  00 

(1.6)  A.  X  ,  =  Y  A.  X  .  ,  , 

1  0  n+1  ,  ,  k  n+l-k 

k=l 

which  can  be  solved  for  X  expressing  it  (uniquely)  in  terms  of  the  observed 

values  of  the  process,  provided  that  A^  is  invertible.  However,  as  we  shall 
see  in  section  4  the  invertibility  of  Afl  is  tied  up  with  the  invertibi 1 ity  of 
the  prediction  error  matrix  G.  Thus  the  question  of  the  rank  of  G,  or  equiva¬ 
lently  the  rank  of  the  process  {X^},  enter  the  scene.  Also,  it  follows  from 
(1.5),  upon  formal  substitution  in  (1.2),  that  the  autoregressive  representation 
of  (X  }  may  entail  an  autoregressive  representation  of  its  linear  least  square 
predictor . 

1.7  Definition.  Let  {X^}  be  as  in  (1.1)  and  v  >  1  be  a  fixed  integer.  IVc  say 
that  the  linear  least  square  predictor  of  (Xn )  has  an  autoregressive  representation 

r  i or' 

if  there  exists  a  sequence  of  constant  q*q  matrices  ( 1.^  such  that 

(  1  .  S )  X  =  l  I:  .  X  ,  , 

n+v  j._()  n_k 

where  the  infinite  series  is  to  converge  in  the  square  mean. 

In  the  light  of  Definitions  1.4  and  1.7  it  is  natural  to  ask  how  the 


autoregressive  representation  of  IX^ }  and  that  of  its  linear  least  square 
predictor  are  related.  In  section  4  we  show  that  these  two  representation  arc, 
indeed,  equivalent  regardless  of  the  rank  of  the  process  {Xn),  and  in  fact  A 
can  be  taken  to  be  I.  This  result  shows  the  importance  of  the  autoregressive 
representation  problem  of  {X^}  in  prediction  theory.  To  solve  this  problem 
one  has  to  find  conditions  on  the  spectral  distribution  F,  so  that  the  infinite 
series  in  (1.5)  converges  in  the  mean,  which  is  in  turn  equivalent  to  the 
convergence  of 


l  l 

k=0  1=0 


tr\  r£-kAi  <  ~ 


Although  our  initial  results  in  sections  3  and  4  have  been  worked  out  in  the 
time  domain,  which  is  useful  as  to  the  application  in  time  series  is  concerned, 
to  get  such  spectral  criteria  for  the  autoregressive  representation  problem  we 
have  to  move  to  the  spectral  domain. 

After  setting  up  the  notations  and  preliminary  results  in  section  2,  we 
consider  in  section  3,  a  problem  which  is  more  general  than  the  autoregressive 
representation,  namely,  to  find  conditions  on  F  which  enables  one  to  write  any 
V  in  the  time  domain  of  as  a  unique  series  expansion  in  terms  of  X^'s.  The 

main  result  here  is  Theorem  5.8,  which  provides  some  charactcri cat  ion  for 
this  general  property.  An  important  consequence  of  this  characterization  and 
the  other  results  in  this  section  is  the  fact  that  if  the  angle  between  past 
and  future  is  positive  then  the  range  of  the  spectral  density  f ( A )  is  constant. 
This  fact,  together  with  a  technique  used  by  Miamcc  and  Salehi  |12])  (cf.  also 
111])  reduces  our  problem  regarding  a  degenerate  rank  q-variatc  1VSSI’  to  the  same 


problem  for  a  corresponding  full  rank  p-variatc  (p  <  q)  WSSP. 

In  section  4  we  prove  the  equivalence  of  the  autoregressive  representation 


of  X^  and  that  of  its  predictor  and  we  show  that  this  happens  if  and  only  if 
the  Fourier  (Taylor)  series  of  the  reciprocal  Q  *  of  the  factor  of  the 
generating  function  4>  =  ft  fG  of  the  process,  converges  to  it  in  the  norm  of 
L" ( f )  (This  is  Corollary  4.16).  We  should  say  here  that  it  seems  there  is  no 
better  spectral  criterion  for  the  autoregressive  representation ,  without 
further  restrictions  on  X^.  This  is  important  in  view  of  Professor  Masani's 
call  to  find  a  good  necessary  and  sufficient  condition  for  the  validity  of  the 
autoregressive  representation  for  X^  or  its  predictor. 

Of  course,  the  necessary  and  sufficient  condition  just  mentioned  is  not 
very  useful  because  it  is  not  expressed  directly  in  terms  of  the  spectral 
density.  However,  using  this  in  conjunction  with  our  other  results,  we  give 
several  concrete  and  useful  sufficient  conditions  for  the  validity  of  the 
autoregressive  representation.  By  a  theorem  of  Matveev  [9],  the  density  of 
every  purely  non-deterministic  process  has  constant  rank  and  our  work  in 
section  4  is  under  the  additional  requirement  that  the  range  of  f  is  constant, 
which  is  of  course  motivated  by  the  results  of  section  3  mentioned  above.  It 
is'  also  shown  that  analogues  of  the  sufficient  conditions  due  to  Masani  [6] 
and  Pouruhmadi  [14,15|  holds  true  in  the  degenerate  rank  as  well. 

We  finally  remark  that  in  the  presentation  of  this  study  a  special  attempt 
is  made  to  work  in  the  time  domain  as  far  as  possible.  Such  a  program  is 
particularly  useful  for  the  purpose  of  applications.  Also  this,  and  especially 
working  with  the  nonnorma  1  i  zed  process  *s  helpful  os  it  postpones  the 

complications  arising  from  the  degeneracy  of  the  rank  of  f,  and  results  in  a 
factorization  of  f(>)  in  the  form  Q(A)(1Q(A)  *,  which  whows  that  the  degeneracies 
of  f (  1  )  stem  from  a  constant  matrix,  namely  the  prediction  error  matrix  (1. 

(For  more  on  this  see  |7,  Theorem  13.3). 


6 


2 

2.  Preliminaries.  Let  (fi,G,P)  be  a  probability  space.  II  =  L0(ft,G,P)  denotes 
the  Hilbert  space  of  all  complex-valued  random  variables  on  ft  with  zero 
expectation  and  finite  variance.  The  inner  product  in  H  is  given  by 

(x,y)  =  Lxy ,  x,y  e  H. 


following  [7],  for  q  >  1 ,  HP  denotes  the  Cartesian  product  of  II  with 

T 

itself  q  times,  i.e.  the  set  of  all  column  vectors  X  =  (x^,x^ . x^)  with 


x.  t 

i 


H,  i  =  1,2 - : 


HC*  is  endowed  with  a  Gramian  structure:  for  X  and  V 


in  their  Gramtan  is  defined  to  be  the  q*q  matrix  (X,V)  =  [(x. ,y.)]V  ,_j. 

is  a  Hilbert  space  under  the  inner  product  ((X,Y))  =  trace  (X,Y)  = 
a 

l  (x.,y.)  and  norm  ||x||  =  /((x,x))  provided  the  linear  combinations  are 
j  =  l  1  -1 

formed  with  constant  q*q  matrices  as  coefficients. 

q 

For  a  q*q  matrix  A  =  (a..),  tr  A  =  V  a..,  A*  =  (a..)  and  dot  A  stands  for 

JJ  J1 

determinant  of  A.  When  A  is  singular  Aif  denotes  its  Moore-Penrosc  general  iced 

inverse,  functions  are  defined  on  ( - rr , tt ]  and  we  identify  this  interval  with 

the  unit  circle  in  the  complex  plane  in  the  natural  way.  Typical  values  of 

a  function  f  defined  on  ( - tt , rr ]  or  on  the  unit  circle  will  he  denoted  by  f(>  )  . 

dm  denotes  the  normalized  Lebesgue  measure  on  ( -  tt  ,  tt  ]  .  for  1  <  p  <  to,  LPUP  ) 

denotes  the  usual  Lebesgue  (Hardy)  space  of  functions  on  the  unit  circle. 

LP  (HP  i  denotes  the  space  of  all  q*q  matrix-valued  functions  whose  entries 
q*q  q>q 

arc  i n  I P i HP) . 

Let  {X  ,  n  =  0 ,  + 1  , . .  . }  •  11^.  It  is  said  that  {X  }  is  a  q- variate  weak  I  v 
n  -  n  • 

stationary  stochastic  process  (WSSP),  if  the  Gramian  matrix  (X  )  depends  onl 

m  n 

on  m-n.  It  can  be  shown  |7|  that  such  a  process  has  a  spectral  re.x  resen  tat:  on 


of  the  form 


7 


r  tt 


-71 


-inX 


dZ  (X) 


where  Z('J  is  a  countably  additive  orthogonally  scattered  H^-valued  measure. 

The  q*q  nonnegative  matrix  valued  measure  (•'(•)  =  (Zf-),ZC-))  is  called  the 

szectral  distribution  of  {X  }.  In  case  F  <<dm,  we  sav  that  {X  }  has  the 

n  n 

spectral  density  f  =  F*  =  The  spectral  domain  corresponding  to  the  spectral 

distribution  F  is  denoted  by  L“(dF)  and  is  defined  by 


fi;  7  is  a  q*q  matrix  valued  function  with  ||f||jl  = 

r71 

J  trT (9)  dF  ( 9)  7*  (6)  <  °°} . 

-7T 


7 

It  is  well-known  [7]  that  L“(dF)  with  inner  product 


=1  tr  <D  dF7* 

i  -71 

is  a  Hilbert  space. 

For  each  subset  (  .  .  . }  of  H^,  sp  (...)  stands  for  the  closed  linear  span 
of  elements  of  ^...}  in  the  metric  of  11^  and  the  following  subspaces  associated 
to  our  process  {X^}  are  needed. 


I!(X)  =  sp  {XR;  k  =  0.+1... 

Pn(X)  =  sp  (Xk;  k  <  n)  ,  n  =  0  ,  +  l  , .  .  .  , 

Fn(X)  =  sp  {Xj.;  k  >  n},  n  =  0,+_l  ....  , 

+  «»> 

I'_(Jx)  =  n  P  lx). 
n= 

M  _( X  )  =  s p  f X .  ;  k  i  n ) ,  n  = 


8 


The  space  H(X)  is  referred  to  as  the  time  domain  of  the  process  {X^}.  It  is 
well-known  [7]  that  the  correspondence 


T:  T  -  I  T (A)  dZ (X) 


is  an  isometric  isomorphism  from  L  ( dl- )  onto  H(X).  T  is  called  the  Kolmnnoroj 
isomorphism  between  the  spectral  and  time  domain,  and  plays  an  important  role 
in  finding  analytical  conditions, in  terms  of  F,  for  the  following  important 
geometrical  (regularity)  properties  of  the  WSSP  {X^}. 

2.1  Definition:  Let  {X  }  be  a  q-variate  WSSP 
-  n 

a)  tX^}  is  said  to  be  purely  nonde term inis  tic  (regular)  if 


P_oo  (X)  =  {0}. 


b)  {X  >  'is  said  to  be  minimal  if  for  some  n 
n 


Mn(X)  t  il(X) 


c)  -X  }  *s  said  to  be  J  -regular  if 
n  0 


M  (X)  =  {0}, 


d)  It  is  said  that  the  past-present  and  the  future  subspaces  of  {X^i  arc  at 

r  ? sitise  -male  if 


where 


P(X)  =  P  ( I  ")  <  1, 


riX)=  P(F)  =  sup  ( (  ( Y ,  Z ) )  :  Y  r  P()  (X  ) ,  Z  c  F  (X)  and 


I  I V I  |  =  1,  tU||  =  1). 


.  *  .  ‘ 'j.*  .  *  .  •  .  '  .  • 


.  *.  **  S  7. 


Lot  (X  }  be  a  purely  nondetcrmini st ic  WSSP.  The  best  linear  predictor 

n 

of  \  .  v  >  1,  based  on  X  ,  X  is  given  by 

n+'j  —  r  n-1 


X 

n+j 


(X 


n+v 


I1 

n 


(X))  , 


where  the  latter  denotes  the  orthogonal  projection  of  on  the  subspacc 

P  (X)  of  11(X).  lor  such  a  process  we  define  a  new  process  {e  }  bv 
n  n 


En  =  Xn  *  <XJPn-l(X))’  "  = 


and  it  is  called  the  innovation  rroeess  of  (X  }.  It  is  known  that  (e  1  satisfies 

n  n 

(  ,  ?:  )  =  6  G,  and  G  is  called  the  prediction  error  matrix  of  lag  1.  A 

m  n  m,n 

WSSP  (X  }  is  said  to  be  of  full  rank  if  its  matrix  G  is  invertible  (full  rank). 
Otherwise,  the  process  { }  is  of  degenerate  rank. 

It  follows  from  (1.1J  that  the  spectral  density  f(A)  of  a  purely  nondcter- 
ministic  process  admits  a  factorization  of  the  form 


f(>)  =  n(X)Gft(X)*  =  fi(A)G1/2  ffi(>.)G1/2]*, 

where 

oo 

fi(A)  =  l  CkelkA, 
k=0 

is  in  11“  .  It  is  shown  in  [7]  that  Q(’)  is  an  almost  everywhere  invertible 

q « (| 

function.  Thus,  it  follows  that  when  {X  }  is  not  of  full  rank  then  f ( - )  is 


not  invertible. 
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5.  Basicity  and  positivity  of  the  angle. 


In  this  section  we  will  define  the  idea  of  generalized  Schauder  basis  for 

a  set  of  vectors  and  also  give  the  definition  of  the  angle  between  past-present 

and  future  for  a  q-variate  W'SSP,  and  then  we  give  several  criteria  for  a  WSSP 

X  to  form  a  Schauder  basis  for  its  time  domain  H(X) .  We  will  also  get 
n 

several  other  results  which  are  essential  in  dealing  with  the  problem  of 
autoregressive  representation  for  degenerate  rank  multivariate  processes  in  the 
next  section. 

Because  of  the  imporatnee  of  the  mean-square  convergences  in  different 
areas  of  applications  (particularly  time  series  analysis)  it  seems  that  the 
idea  of  ‘X^l  forming  a  generalized  Schauder  basis  is  more  appropriate  than 
the  weaker  requirement  of  {X^}  forming  a  conditional  basis  as  studied  by 
Rozanov  [1“,  pp.  104-108]. 

To  get  a  feeling  as  to  how  the  question  of  uniqueness  of  the  representation 
of  elements  of  1 1  ( X )  in  terms  of  a  sequence  will  arise  and  should  be  settled 
in  the  non  full  rank  case  we  start  this  section  with  an  example: 


5.1  Ixample.  Let  {e  }  be  a  univariate  white  noise  process,  i.c.  be  c  =  * 

- - 1 —  n  m  n  m ,  n 

and  let  {X  }  be  the  bivariate  process  defined  by 


,  n  =  0,+l ,+2, .  . .  . 


or  a  fixed  k,  consider  the  element  V  = 


l(X).  Note  that  for  this 


element  we  have  several  different  representations  in  terms  of  { Xn } ,  viz. 


and  vet 


o 

l—i 

- 

1  o' 

1  1' 

1  0 

1  1 

.1  K 

i.e.  there  is  no  unique  representation  for  Y  in  terms  of  X^'s. 

In  view  of  this  simple  example  and  the  need  for  uniqueness  of  the  linear 
representation  in  terms  of  X^'s  in  practical  problems,  it  is  important  to  find 
conditions  on  {X  }  so  that  every  element  of  H(X)  has  a  linear  representation 
in  terms  of  {X^}  which  is  unique  in  some  sense. 

Next  we  define  two  kinds  of  uniqueness  for  linear  representation  of 
elements  of  H(X).  Throughout  this  section  {A^}  denotes  an  arbitrary  sequence 

OO 

of  q^q  matrices,  and  it  is  understood  that  the  infinite  series  £  AX  converges 

n=-°° 

in  the  norm  of  or  in  the  square  mean. 


3.2  Definition.  Let  {X  ,  n=0 , +1 , +2 , . . . }  be  a  q-variate  process  in  H  . 

a)  (X  }  is  said  to  be  a  Sehaudev  basis  for  H(X)  if  every  Y  e  H(X)  has  a  unique 


representat ion 


Y  =  y  AX, 
L  n  n 
n=-°° 


in  the  sense  that  if  Y  has  another  representation 


Y  =  J  A  *X  , 
n  n 

n=-<» 


then , 


A  =  A ' ,  for  all  n . 
n  n 


h)  It  is  said  that  {X  }  is  a  ienr  valir.rd  SaJiaudev  his:  a  for  1 1  ( X )  if  cverv 

n  •  ' 
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Y  e  H(X)  can  be  represented  as 


Y  -  l 


A  X  , 

n  n 


n=-°° 

and  furthermore  this  representation  is  unique  in  the  sense  that  if  Y  has 
another  representation  as 

OO 


Y  -  I 


n=-<*> 


k'\  , 
n  n 


then , 


AX  =  A'X  ,  for  all  n. 
n  n  n  n 


3 . 5  Remarks . 

a)  It  should  be  noted  that  the  notion  of  Schauder  basis  for  H(X)  defined  in 
5.2fa)  is  different  from  that  defined  in  the  literature  on  classical  Banach 
spaces.  Since  here  H(.X)  is  a  linear  space  over  the  ring  of  q*q  matrices  instead 
of  the  field  of  scalars.  In  this  setting  it  is  possible  to  develop  a  theory 

of  Schauder  basis  for  H(X)  which  is  similar,  but  technically  different  from 
that  for  classical  Banach  spaces. 

b)  lor  the  classical  Banach  spaces  the  notions  of  Schauder  and  generalised 

Schauder  basis  are  equivalent.  This  is  not  the  case  for  H(XJ  as  lixamplc  3.1 

shows.  In  this  example  {X  }  is  a  generalized  Schauder  basis  but  not  a  Schauder 

n 

has i s . 

Since  a  generalized  Schauder  basis  for  li(X)  is  not  necessarily  a  Schauder 

basis,  it  is  of  interest  to  impose  conditions  on  {X^}  or  I  so  that  the  two 

notions  become  equivalent.  When  IX  }  is  a  q-variatc  WSSP  we  have: 

1  n 

3.1  Theorem.  i.et  f.X  )  be  a  q-variate  IVSSr  with  the  spectral  d  i  st  i  rbut  i  on  I 

.  —  .  _  n 

and  let  ”  =  (X  ,X  )  =  f  d  I  ( X 1  .  Then  the  follovving  conditions  are  equivalent: 

0  n  n  -ii  °  1 


a)  {Xn}  is  a  generalized  Schauder  basis  for  H(X)  and  is  invertible. 

b)  ■. X  •  is  a  Schauder  basis  for  H(X). 

n 

Proof,  a  -^>b . 

Let  Y  e  H(,X)  have  two  representations ,  viz. 

oo  oo 

v  =  y  a  x  =  y  a'x  , 

L  n  n  L  n  n 
n=-<»  n=-°° 

since  {X^}  is  a  generalized  Schauder  basis  for  H(X)  we  have 

AX  =  A"X  ,  for  all  n. 
n  n  n  n 


Thus 


A  Tn  =  A  (X  ,X  )  =  A"(X  ,X  )  =  AT,  for  all  n. 
nO  nnn  nnn  n  0 


But,  since  FQ  is  invertible  we  get 


A  =  A',  for  all  n, 
n  n  ’ 

i.e.  {\n)  is  a  Schauder  basis  for  H(X) . 


>  a . 


Suzposc  {X  }  is  a  Schauder  basis  for  II (X)  and  F.  is  not  invertible 
• :  n  0 

there  exists  a  non-zero  vector  a  =  (oij,...,a  )  such  that 


l  a. X  .  =  0, 

i=l  1 


This 


that  for  0  =  Y  <.  ll(Xj  we  have 


which  contradicts  the  assumption  that  {X^}  is  a  Schauder  basis  for  H(\) .  Q.K.L). 

5.5  Remark.  Since  FQ  _>  G,  it  follows  that  when  {X^ }  is  of  full  rank,  then 
the  two  notions  of  bases  are  equivalent. 

The  next  theorem  which  provides  a  necessary  and  sufficient  condition  for 
a  a-variate  process  {X  }  to  be  a  generalized  Schauder  basis  for  1!(X)  is  essentia] 
in  the  rest  of  this  work.  This  theorem  is  a  generalization  of  a  well-known 
theorm  of  Nikolskii,  cf.  [4,  p.  105],  to  the  setting  of  H(X) .  Although  the 
main  steps  of  its  proof  are  the  same  as  those  in  the  classical  setting,  the 
details  are  different  as  the  A^'s  here  are  q*q  matrices  instead  of  being  complex 
scalars.  Since  the  proof  is  lengthy  we  have  relegated  it  to  an  appendix. 


5.6  Theorem.  A  q-variate  process  {X^}  is  a  generalized  Schauder  basis  for  1I(X) , 
if  and  only  if  there  exists  a  positive  real  number  M  such  that 


n=k 


A  X 
n  n 


M 


A  X 
n  n 


OO 

•  f  (| 

for  anv  (  >  k  and  all  q*q  matrices  A  ,  n  =  0,+l,...  ,  with  )  A  X  e  !!’. 

n=-°° 

In  general,  it  is  hard  to  verify  the  condition  of  Theorem  5.q.  However, 

when  {X  }  is  a  USSR,  then  as  it  turns  out  this  condition  is  equivalent  to  the 
n 

geometrical  condition  that  the  past -present  and  future  subspaces  of  (X  }  being 


at  positive  angle.  This  is  summarized  in  the  next  lemma  whose  proof, being 
exactly  the  same  as  in  t ho  univariate  case  (cf.  |5,  pp.  1 20- 1 50 | ),  is  omitted. 


3.7  Lemma.  Let  {X^}  be  a  q-variate  WSSP.  Then  the  following  are  equivalent: 


a)  ~(X)<1. 

bj  There  exists  a  positive  number  M  such  that 


l  A  X  ! I  <  M| |  l  AX 

L  n  n  —  1  1  L  « 


n=k 


n=  -oo 


n  n  1 


for  all  t  k  and  all  q*q  matrices  A  ,  n  =  0,+l,+2, 


,  with  Y  A  X  c  I)1*. 

n  n 


n=-oo 


n  n 


Next,  by  combining  Theorem  3.6  and  Lemma  3.7  we  get  the  following  important 
resul t . 


5.8  Theorem.  Let  {X^}  be  a  q-variate  WSSP.  Then  the  following  are  equivalent: 

a)  r.  (X1<1 . 

b)  {X  }  is  a  generalized  Schauder  basis  for  H(X). 

In  view  of  Theorem  3.8  it  is  important  to  characterize  WSSP's  for  which 
p  ( X  )  < 1 ,  i.c.  to  find  spectral  characterization  for  this  useful  geometrical 
property.  When  {X^}  is  a  q-variate  WSSP  of  full  rank  such  characterization  is 
given  in  [10,14,16],  It  is  important  to  note,  however,  that  the  techniques 
used  in  these  papers  do  not  work  when  { }  is  not  of  full  rank. 

Here  we  use  a  different  method  which  is  based  on  exploiting  the  character¬ 
ization  of  o(X)<l  given  in  Theorem  5.8  and  the  uniqueness  of  the  representation 
of  elements  of  H(X]  when  {X^}  is  a  generalized  Schauder  basis.  As  a  result 
of  this  we  show  that  when  { X  }  is  a  generalized  Schauder  basis  for  H(X) ,  then 

;.\  •  is  .)  -regular, 
n  0 

3.0  theorem.  Let  (X  \  lie  a  q-variate  WSSP  with  rlXKl.  Then  IX  }  is  .1  -regular 
-  n  n  0 

>ier  r  (X  1  <  I  ,  it.  follows  from  Theorem  3.?l  that  [X  }  is  i  jencrali  r.cd 


I’ roof. 


Schauder  basis  for  H(X).  To  shew  that  {X^}  is  J^-regular,  let 

CO 

V  e  M  (X)  =  n  M  (X) .  Thus 
n=-°° 

Y  e  Mn(X),  for  all  n. 

i'.ow,  as  in  the  proof  of  the  "if"  part  of  Theorem  Z.  6  one  can  show  that 
{X^;  k  t  n}  is  a  generalized  Schauder  basis  for  M^(X).  Hence,  there  exists  a 
series  representation  for  Y; 


Y 


l 

k^n 


k’ 


for  all  n. 


From  this  and  the  uniaueness  of  such  representations  as  defined  in  3.2  (b)  we  get 

A.  X  =0,  for  all  n,k, 
k,n  n 

which  implies  that  Y  =  0.  Q.r. .0. 

As  an  immediate  consequence  of  Theorem  3.9  we  see  that  if  o(Xl<l,  then  the 
process  { X^ }  is  minimal,  and  purely  nondetermini  Stic .  The  following 
Corollary  is  very  crucial  for  our  purposes. 

3.10  Corol  lary .  Let  {.X  }  be  a  q-variatc  W'SSP  with  the  spectral  distribution  I- . 

If  r  ( X ) <  1 ,  then  F  has  the  following  properties: 

a)  !•  is  absolutely  continuous  with  respect  to  the  I.cbesguc  measure  on  f  - tt  , ti  |  . 

b)  R(f)  =  constant  supspace  a.e.,  where  f  is  the  density  of  the  process  and  R(f) 
denotes  the  range  of  f  when  f(-J  is  viewed  as  an  operator  from  (T^  into  <f(*  . 

c)  f  t  I  '  ,  where  I"  denotes  the  Moo  re -Pen  rose  genera  lined  inverse  of  the 

‘I'M 


I 


matrix  f. 
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Proof.  Since  <  1 ,  by  Theorem  3.9  our  WSSP  {X^}  is  J ^-regular  and  it  is 
o;.v.v>:  ' '.aka: on  and  Weron  [5,  Theorem  5.3]  that  {X  }  is  J  -reauVir.  if  in :  only 

if  F  satisfies  the  conditions  a,b,  and  c. 

Corollary  3.10  gives  some  very  useful  necessary  conditions  for  p(X)  to  be 
strictly  less  than  one,  the  most  important  of  these  is  the  condition  concerning 
the  constancy  of  the  range  of  f  as  the  subsequent  argument  will  reveal.  Thus 
in  the  following  we  work  under  the  following  natural  assumption: 

3.11  Assumption. 

(i)  F  <<  dm, 

(iij  k(fj  =  constant  a.c.(drn), 

(iii)  f"  c  1*  . 


Under  this  assumption  our  main  problem  is  reduced  to  characterising 
WSSP’s  for  which  p(X]-'l.  Since  R(f)  is  a  constant  subspace  of  ,  we  let  K 
denote  this  subspace  and  p(0  <  p  q)  its  dimension.  It  follows  from  the 
proof  of  Theorem  5.1  in  [12]  that  there  exists  a  q»q  constant  unitary  matrix  II 


such  that 


(5.12 


g(-) 

l)f(*)U*  - - 


''here  g  ( -  )  is  a  p»p  mat  ri.x- valued  function  on  (-n,Tr|.  it  is  easy  to  check  that 
g  is  the  spectral  density  of  a  p-variate  purely  nondetermin i st i c  full  rank  WSSP, 
say.  •’  V|t :  •  This  matrix  II  and  the  WSSP  { Y  }  play  important  roles  in  what 
follows  and  their  relationship  with  f  and  (X  }  is  prescribed  by  (3.12), 
throughout  this  paper.  An  important  consequence  of  this  relationship  is  the 
fol 1 ow i ng . 


-  ■*-  i*-  ■*-  ■*  ■*  ■*  Al  V.'  *-■  '  -  V 


.  -v  .  * 

V-V-V-' ■.* 
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3.13  Theorem.  Let  {X^}  be  a  q-variate  (not  necessarily  full  rank)  WSSP  whose 
spectral  distribution  F  satisfies  (3.11).  Then,  with  notation  as  above,  we  have 

p(X)  <1,  if  and  only  if  p(Y)<l. 

Proof.  It  follows  from  3.12  that 


U  X  =  -  -  -  , 

n 


thus  for  each  Z  £  H(Y)  we  have 

<  1  ' 

n  =  U*  I - 1  e  H(X) . 


Assure  that  p(X)  <1.  Then  by  Theorem  3.8,r)  has  a  representation. 


n  =  I  AX. 
m  n  n 

n=-oo 


Therefore, 


up  =  T  u  a  u*ux  , 

L  n  n 

n=-°° 


which  imrlies  that,  with  C  =  UA  II*,  we  have 

n  n 


(  r 

00 

-  I 

C 

Y  1 

-  _n_ 

.  °  . 

n=-°“ 

n 

.  o  . 

Z  =  V  l)  Y  , 

L  n  n 

n=-°' 

1'^  is  the  pxp  leading  rrincipal  minor  of  C  .  By  using  the  uniqueness  of 
the  representation  of  n  one  can  show  that  this  representation  of  Z  is  unique 
in  the  sense  of  Definition  3.2(b),  Thus  {Y  }  is  a  generalized  Uchaudi  r  basis 
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for  11  (VI  rind  therefore  by  Theorem  2.8  we  get  that 

P(X)<1. 

The  proof  of  the  other  direction  is  similar.  Q.I..I). 

Theorem  3.13  reduces  the  problem  of  characterizing  non- full  rank  processes 
with  p t X j < 1  to  that  for  full  rank  processes  with  smaller  dimension.  In  view  of 
this  one  can  invoke  the  known  results  in  the  full  rank  case  and  state  the 
appropriate  conditions  in  terms  of  g,  the  spectral  density  matrix  of  the 
corresponding  full  rank  process  {V  }.  We  should  note  that  the  statement  of 
the  next  theorem  is  not  correct  if  one  replaces  g  by  f,  as  Example  3.1  provides 
a  counterexample  to  this  effect.  Proof  of  the  next  theorem  is  immediate  from 
3.15  and  the  results  of  [10]. 

3.14  Theorem.  Let  {Xn}  be  as  in  Theorem  3.13,  then  the  following  arc 
equivalent : 


a  >  c  C  X )  <  1 


1  1  1 

hi  L“(g)  -  I.  and  the  lourier  series  of  anv  function  iJj  in  1.  fg)  converges  to 

°  pi'P 

v  in  the  norm  of  i,"(g). 


Although  Theorem  3.14  provides  necessary  and  sufficient  conditions  for 
n  1  X  i  <  I  in  terms  of  the  spectral  domain  I. "(g)  of  { Yn } ,  it  docs  not  provide  any 
concrete  conditions  involving  the  entries  of  the  spectral  density.  In  the 
following  we  review  some  known  results  which  provide  more  tangible  conditions 
for  o(.V)--l.  However,  in  light  of  Theorem  3.11,  we  state  all  these  results  for 
a  full  rank  1VSSP. 


In  the  univariate  case  a  complete  character  i  zat  i  on  of  1VSSP  with  i  (Xl-'l  is 


given  by  Helson  and  Szego  [3]. 


3.15  Theorem.  Let  {X^}  be  a  univariate  WSSP  with  density  f.  Then  p(X)<],  if 
and  only  if 


(5.16) 


f  = 


u+v 

e 


where  u  and  v  arc  bounded  real-valued  functions  with  ||v|[  <  —  ,  and  v  denotes 

O.J  ^ 

the  harmonic  conjugate  of  the  function  V. 

For  q-variate  processes  one  can  find  such  a  characterization  provided  that 

f  has  some  special  properties.  For  a  q*q  matrix-valued  function,  denote  the 

smallest  and  largest  eigenvalues  of  f(A)  by  f. (A)  and  f  (A),  respectively. 

—  R 

With  these  notations  we  have  [14,  Theorem  5.3], 


3.17  Theorem.  Let  {X^}  be  a  q-variate  purely  nondeterministic  full  rank  WSSP 
with  a  spectral  density  f  satisfying 


(3.18) 


f 

_a 

f. 


oo 

e  L  . 


Then  p(\)<l, 


if  and  onlv  if  f  satisfies 

q 


(5.16). 


This  theorem  provides  a  characterization  for  p(\j<]  in  terms  of  the  largest 
eigenvalue  of  f ( * )  which  in  general  might  be  hard  to  apply.  The  next  lemma 
provides  a  necessary  condition  for  p ( X ) < 1  in  terms  of  the  diagonal  entries  of 
the  density  matrix  f.  Proof  of  this  lemma  is  immediate  from  the  definitions  of 
.  (A)  and  the  Kolmogorov's  isomorphism. 


3. IP  Lemma.  Let  {X^}  be  a  purely  nondeterministic  WSSP  with  the  spectral 
density  f.  If  ,  (\)<1,  then  for  each  j  =  1,2 . 2,  f..  is  the  diagonal  element 


of  f  which  satisfies  (5.16). 


It  is  certainly  of  interest  to  show  that  the  condition  of  Lemma  3.19  is 

also  sufficient  for  c(.X)<l.  However,  this  is  not  true  in  general.  It  turns 

out  that  such  a  condition  is  necessary  and  sufficient  for  p(X)<"l,  when  f  is 

nearlv  diagonal  [1],  Let  f  =  (f. .)?  .  ,  and  f  *  =  be  its  inverse 

i J  J  >  J  = 1  i ,1  =  1 

this  notation  we  have  the  following  definition  due  to  Bloom  [1|. 


5.20  Definition.  An  almost  everywhere  invertible  matrix- valued  function  f  is 
said  to  be  nearly  diagonal,  if 


It  is  easy  to  see  that  every  diagonal  density  f  is  nearly  diagonal, 
but  the  converse  is  not  true.  The  next  lemma  provides  a  large  class  of  nearly 
diagonal  matrices  which  are  not  necessarily  diagonal. 

5.21  Lemma.  If  f  satisfies  (3.18),  then  f  is  nearly  diagonal. 

The  following  important  theorem  provides  a  necessary  and  sufficient 
condition  for  a  q-variate  WSSP  {X^}  with  a  nearly  diagonal  density  to  have  the 
property  r  (X) < 1 . 

3.22  Theorem.  Let  ( Xn )  be  a  q-variate  purely  nondetermini  st  >  c  full  rank  IV  SSI’ 

with  a  nearly  diagonal  spectral  density  matrix  f.  Then  r(X)<l,  if  and  nnlv 

if  for  everv  i,  1  <  i  <  q,  f..  satisfies  (3.16). 

-  -  .l.i 

It  is  immediate  from  Propositions  1.2, 


Proof . 


4.5  |11. 
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4.  AUTOREGRESSIVE  REPRESENTATION  OF  WSSP's. 


In  this  section  we  establish  the  equivalence  of  the  autoregressive  repre¬ 
sentation  of  a  IVSSP  {X  }  and  that  of  its  linear  least 

n 

squares  predictor  based  on  the  past.  As  a  consequence  of  this  we  obtain  a 
spectral  necessary  and  sufficient  condition  for  the  latter.  This  spectral 
characterization  is  used  to  provide  more  concrete  and  sufficient  conditions,  in 

A 

terms  of  f,  for  the  autoregressive  representation  of  X 

n+v 

We  have  from  the  moving  average  representation  (1.1)  of  {X^},  and  that 
{g  }  is  its  nonnormal i zed  innovation  process: 

(4.1)  (c  ,e  )  =  6  G,  for  integers  m,n, 

v  v  m  n  m,n 

(4.2)  (X  ,e  )  =  0,  m<n. 

m  n 


lor  the  time  being  we  assume  that  {^1  has  a  mean -convergent  autoregressive 

representation  as  in  (1.5)  for  a  sequence  {A^l.  first  we  attempt  to  express 

these  A  's  i-n  icrms  °f  the  G^'s  in  (1.1).  We  do  this  in  order  to  show  in  a 

natural  way  the  importance  of  the  rank  of  G  in  this  determination.  However,  as 

we  shall  see  this  time  domain  procedure  does  not  provide  a  satisfactory  solution 

to  our  problem  when  {X  }  is  not  of  full  rank.  Therefore,  when  i.X  }  is  of 

n  n 

degenerate  rank  we  appeal  to  a  spectral  domain  argument  and  resolve  the  problem 
of  determining  the  A  ' s  in  its  full  generality. 

From  i  1.1)  and  (4.2)  for  any  C  >  0  and  any  integer  n,  we  have 


0,f 


=  If  ,  f 
n  n  - 


,)  =  y 

k=0 


Wk’Si-f 


( 

)  -  l 

k  =  0 


A,  (  X  .  , l  , )  , 
k  n-k  n-f  ’ 


which  combined  with 


;  X  ,  ,t  .,)  =  G..  .  G 

n-k  n-t  t-k 


k,f  >  0, 


I 


(4.3) 
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C 

JQ  AkC£-kC  =  S0,l  ’  for  a11  1  >  °> 

or  equivalently  (since  =  I), 


(4.4) 


l  A ,C„  ,  G  =  0  (or  A  G 
k=0 


l- 1 

-  [  A  C„  G),  l  >  t). 
k=0 


This  shows  the  relationship  among  the  matrices  A^'s,  C^'s  and  G,  when 
{ ;  has  a  mean-convergent  autoregressive  representation.  In  particular,  it 
follows  that  when  G  is  not  of  full  rank,  then  AQ,A  , ...  can  not  (necessarily) 
be  found  uniquely  in  terms  of  ' s . 

To  reveal  more  fully  the  role  of  G  in  determining  the  A^'s,  we  need  to 
introduce  some  notation.  Corresponding  to  the  moving  average  representation  (1.1) 
of  ;\n)  we  consider  the  matrix-valued  function  ft  on  (-r,r]  given  by 

OO 

(4.3)  fi(X)  =  l  CReikA  , 

k=0 

it  is  known  [7,  Theorem  15.5]  that  for  almost  every  '  in  ( -  -t .  t  ]  this  matrix  function 
ft  is  invertible  and  that  the  spectral  density  matrix  f(>).  then  admits  the 
factori  cat  ion 


(4.0)  f(M  =  ft(MGft(M*  ={  «(>)(', 1  /2]  |T(>)G1/:]*, 

1  / 1 

where  t=  (>  is  referred  to  as  the  generating  function  of  the  WSSP  (\  }. 

n 

Let  the  matrix-valued  function  H'(>)  denote  the  isomorph  of  r  in  L“(f), 


then  it  follows  from  (1.5)  that 
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(4.7)  nx)  =  l  vlkX. 

k=0  K 

In  light  of  (4.3)  or  (4.4)  it  is  easy  to  check  that  the  functions  Q,<J>  and 
H*  satisfy  the  equation 

(4.8)  T<5>^=4'fiG  =  G. 

This  shows  that  when  G  is  not  of  full  rank  then  one  may  not  be  able  to  express  S', 
the  isomorph  of  eQ,in  terms  of  4>  or  SI.  However,  when  the  process  {X^}  is  of  full 
rank  or  when  G  is  invertible  then  from  (4.4)  and  (4.8)  one  obtains  explicit 
formulas  expressing  A^'s  in  terms  of  the  C^'s  and  what  is  more  important  or 
even  surprising  is  that  from  (4.8)  we  get  S,  =  v/&$^  =  £2*. 

The  previous  time  domain  argument  along  with  the  assumption  of  existence 
of  mean-convergent  autoregressive  representations  of  {X^}  was  used  to  show  the 
shortcoming  of  the  commonly  used  time  domain  approach  in  handling  problems 
related  to  degenerate  rank  WSSP's. 

Next,  we  use  a  spectral  domain  argument  to  show  that  for  any  purely 

2  - 1 

nondeterministic  WSSP,  V,  the  isomorph  of  in  L  (f)  is  given  by  ¥  =  £2 

Let  4'  denote  the  isomorph  of  then  we  have  from  (1.1)  and  the  Kolmogovov's 
isomorphism  that 

Ic'lnA  =  (e” 1  nX  +  Cle'l(n_1)X  +  ...  m>.)  =  c‘lnX  .Q(M'}>(X) , 
or  =  T.  But,  since  71  is  invertible  a.e.,  it  follows  that 

(4.0)  TO  =  1  =  £2T. 

I  bus  we  have 

4.10  Lemma .  Let  {X  }  be  a  purely  nondeterministic  q-variate  WSSP  with  the 
spectral  factori  cat  ion  (4.6).  Then  the  isomorph  of  e  in  I.  “  ( f )  is  given  by 


v  v. 


rflVlV- «'  .V.  »•- ai. 


•*.  "  •  *'••*, 
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e'inV  Vi 


Another  important  consequence  of  (4.9)  is  the  following  set  of  identities 
which  are  crucial  in  the  proof  of  Theorem  4.12. 


A0  =  1 


(4.11)  <  l  l 

I  j  =  %  Cf-kAk  =  °*  1  >  °* 

!  k=0  k=0 


In  the  next  theorem  we  establish  the  equivalence  of  the  mean-convergent 

autoregressive  representations  of  {X^}  and  that  of  its  linear  least  squares 

predictor  X  ,  v  >  1.  We  note  that  the  method  of  proof  of  this  theorem  is 
n+v  — 

similar  to  that  used  by  Bloomfield  [2,  Theorem  1].  But  his  method  does  not 

generalize  to  the  case  of  q-variate  degenerate  rank  WSSP's,  because  from  (4.4) 

l 

one  can  not  conclude  that  I  Cp  ,  A  =  0,  for  l  >  0,  a  fact  which  plays  a  crucial 

k=0  *-'K  K 

role  in  the  proof  of  the  theorem,  cf.  (4.15). 

4.12  Theorem.  Let  {X^}  be  a  q-variate  purely  nondeterministic  (not  necessarily 
full  rank)  W'SSP  with  a  one-sided  moving  average  representation  as  in  (1.1). 

Then  the  following  are  equivalent: 

a)  JXn)  has  a  mean-convergent  autoregressive  representation. 

b)  For  v  >  1,  we  have 

~  00 

x  =  7  !  •:  .  x  ,  , 

n+v  ,Ln  vk  n-k’ 
k=0 

v 

where  1^  =  -  1  k  =  0,1,2,...,  and  the  infinite  series  defining 

X  is  convergent  in  the  square  mean. 


Proof,  (b)  =>  (a)  is  trivial  zn  view  of  (1.3). 
Tc  shew  that  (.a)  ->  (b) ,  from  (1.3)  we  get 


(4.13) 


~  v-1  v 

x  -x  =  y  c,  e  .  =  y  C  .e 

n+v  n+v  .Ln  k  n+v-k  ,L.  \>--\  n+ l 
k=0  j  =  l 


Nor,'  from  (a)  or  (1.5)  we  have 


y  C  .r 


v 

l 


j  =  1  V'J  n+J  i=l  V-J  k 


'i  j 


O.  v  j  - 1  <® 

1  A.X  .  =  l  C  .{  I  A.  X  .,+  I  A,  X  .  } 

=0  k  n  +  -’-k  j  =  l  V‘J  k=0  k  n+J‘k  k=j  k  n+-1_k 


V  00 


y  c  .  7  a,  x  .  ,+  y  c  .  y  a  .  .  x  . 

j  =  l  v'-*k=0  k  n  J‘k  j-l  V-J  k=0  J+k  n'k 


V  V 


CO  \) 


=  y  (  y  c  .a  .  . )  x  .  +  y  (  j  c  .a.  . )  x  . . 

,  v-j  j-k  n+k  ,Ln  ,L .  v-j  j+k  n-k 

k= 1  j=k  J  J  k=0  j=l  J  J 

Also,  from  (4.11)  we  get 

v-k 

l  c  .p'Vk  =  l  C 

N  o-f 


(4.15) 


s  =  0 


.  ,  A  =  , 

v-k-s  s  < 


C0A0  =  l>  V  =  k> 


0  ,  v  >  k. 


3v  combining  (4.13),  (4.14)  and  (4.15)  we  conclude  that 


X  -  X 
n+v  n+v 


or 


v 

x  +  y  ( y  c  .  a  .  . )  x  ,, 

n+V  k=0  j  =  l  V'J  -1+k  n'k 

/V  03 

v  =  y  r.  .  x  . 

L  vk  n-k 


n  +  v 


k=0 


Q.P.I). 


It  is  shown  in  I. omnia  4.10  that  the  function  e  *(X)  is  the  isomorph  of 

■> 

:  in  l.“(f).  Thus,  it  follows  from  the  isomorphism  between  the  time  and 

spectral  domains  that  {X  1  has  a  mean-convergent  autoregressive  representation 
in  ll(X),  if  and  onlv  if  the  Pourier  series  of  si  *  converges  to  Q  ^  in  the  norm 
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of  L^(f),  This  observation  combined  with  Theorem  4.12  gives  the  following 
important  characterization  of  processes  {X^}  which  admit  mean-convergent 
autoregressive  representation. 

4 -16  Corollary.  Let  {X^}  be  as  in  Theorem  4.12.  Then  the  following  arc 
equivalent : 

a)  {X^}  has  a  mean-convergent  autoregressive  representation. 

b)  For  v  1 ,  X  has  a  mean-convergent  autoregressive  representation. 

c)  The  Fourier  series  of  51  *  converges  to  51  *  in  the  norm  of  L^(f). 


4.17  Remark.  It  should  be  noted  that  for  any  density  f  with  factorization 

- 1  2  - 1  1 
as  in  (4.6)  ,  51  eL  (f).  However  it  is  not  necessarily  true  that  51  el. 

q^q 

Thus,  one  may  not  be  able  to  define  the  Fourier  coefficients  (and  series)  of 
the  function  52  *.  One  possible  way  of  circumventing  such  difficulties  is  to 
work  with  the  Taylor  coefficients  (and  series)  of  51  *.  Ibis  is  possible  since 
5.  \z),  |z|  <  1,  is  an  analytic  function  and  therefore  has  a  Taylor  expansion. 
In  this  paper,  however,  we  do  not  pursue  this  approach.  Instead,  by  using 
some  of  the  results  of  Section  3  we  impose  appropriate  restrictions  on  f  so 
that  difficulties  of  the  above  type  can  not  occur,  as  the  following  theorem 
shows . 


4.18  Theorem.  Let  {X^}  be  as  in  Theorem  4.12  with  o(X)<l.  Then, 

a)  ; x  1  lias  a  mean- convergent  autoregressive  representation. 

b)  For  v  >  1,  has  a  mean-convergent  autoregressive  representation. 

Proof  of  this  theorem  is  immediate  from  Ihcorem  3.8.  Note  that,  since  {X  1 

n 

forms  a  generalized  Schauder  basis  for  ll(\)  the  autoregressive  representation 

of  (X  1  or  X  is  also  unique  (in  the  sense  of  definition  3.2(b)).  An 
n  n  +  v 
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alternative  proof  of  Theorem  4.18  can  be  obtained  via  Corollary  4.16  and 
Theorem  5.14  and  observing  (from  (3.12))  that 


g  1  0 

n  J  o 

I 


=  llfu*  =  UfiU*UGU*Ufi*U*  = 


( 

f 

G  ft* 

0 

l  1  11 

1 

'l 

1 

0 

0 

1 

— 1- j  .1.2 

n2l|fl22 


f  c.j!  nl 

~  T  ~ 

0  |  0 


^1 1_  J  2l2 

I  ^'*2 

1  I 


where  G.  is  the  one-step  prediction  error  matrix  of  {Y^}  introduced  in  Section  3. 

It  follows  from  (4.19)  that  the  Fourier  (Taylor)  series  of  ft  1  converges  to 

- 1  1  - 1 
ft  in  the  norm  of  L  (f)  ,  if  and  only  if  the  Fourier  series  of  ft^  converges 

- 1  2 
to  ,1  j  j  in  the  norm  ot  L  (g) . 

Theorem  4.18  provides  an  analogue  of  Theorem  4.1  [14],  for  the  degenerate 
rank  case.  Next  we  provide  an  analogue  of  a  theorem  due  to  Masani  [fa], 
when  {X  }  is  of  degenerate  rank.  For  the  time  being  we  assume  that  f  satisfies 
the  following  conditions: 

(4.20) 

Therefore,  it  follows  from  (4.6)  that 


f 


e  i 

L  ,  f  e  I. 

qxq  qxq 


(4.21) 


e  i2  . 
qxq 


C  J  /  -1  .  1  "> 

Since  T  =  G  "ft  r  L"  ,  from  the  Hi csz-Fi seller  theorem  and  the  boundedness 

qxq 

"1/2  -  1 

of  f,  we  conclude  that  the  Fourier  scries  of  <t>  =  C.  ft  converges  to 

It  ]  /  -  _  i  i 

1  =  G  “fl  in  the  norm  of  I.“(f).  However,  this  docs  not  necessarily  imply 

- 1  - 1  2 

that  the  Fourier  (Tai  lor)  series  ot  5 1  converges  to  ft  in  the  norm  of  !.(<') 
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(since  G  is  not  of  full  rank).  The  latter  convergence  is  what  we  need  to 
establish  the  convergence  of  the  autoregressive  representation  of  [X  }  or 
X  +  ,  (cf.  Corollary  4.16).  Thus  we  need  an  additional  condition  so  that 
(4.21)  implies  that 


(4.22) 


- 1  2 

ft  e  L 

q*q 


It  is  easy  to  check  that  if  f  has  a  constant  range  then  (4.21)  implies  (4.22). 
Therefore,  we  have  proved  the  following 


4.2  5  Theorem.  Let  {X  }  be  as  a  purely  nondetermini Stic  WSSP  with  the  spectral 
density  matrix  f.  If  f  has  a  constant  range  and  satisfies  (4.20)  then, 


a)  (X  }  has  a  mean-convergent  autoregressive  representation. 

b)  lor  v  _>  1,  has  a  mean-convergent  autoregressive  representation. 

let  A  denote  the  class  of  densities  for  which  p(X)<l,  and  M  denote  the 
class  of  densities  satisfying  the  conditions  of  Theorem  4.23.  One  can  define 
a  larger  class,  denoted  by  A  ®  M,  which  contains  cither  of  the  previous  two 

classes: 

A  ®  M  =  { f ;  f  =  f[/2fTfJ/2,  fj  >  A,  f,  t.  Ml. 

I.ct  { X  }  be  a  WSSP  with  density  f  c  A  ®  M,  then  by  using  the  method  of 
proof  of  Theorem  3.2  in  [15],  one  can  show  that  {X^}  or  has  a  mean-convergent 

autoregressive  representation.  Similarly,  one  can  form  even  larger  classes 
based  on  A  ®  M,  for  which  the  corresponding  processes  i Xp }  admit  mean-summable 
autoregressive  representations.  Here,  we  do  not  discuss  the  details  of  these 
ideas,  as  already  they  have  been  studied  in  [15|.  Ihe  results  of  the  later 
part  of  this  section  reveal  that  the  problem  of  autoregressive  representation 
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5 .  Appendix;  Proof  of  Theorem  3.6. 

Throughout  this  Appendix  An  and  C^ ,  n  =  0,^1 .  stand  for  qxq  constant 

matrices . 

first,  suppose  that  there  exists  a  positive  number  M  such  that 

t  00 

(5.1)  II  l  Anxn  1 1  <  M I  I  Z  Anxn I  I »  forallk>£. 

n=k  n=-°° 


Let  V  c  H(x) ,  then  since  ll(xj  is  complete  we  have 


w  i  t  h 


Y  =  lim  Y  , 
n 

n-x» 


Y  =  y  C^n)X.  ,  n=l  ,2  , .  .  .  , 
n  .  ,  i  i 


i=k 


where  k  ,  m  are  integers, 
n  n 


Next,  we  show  that  for  each  fixed  i,  {cfn^X.}  is 


i  l 


a  Cauchy  sequence  in  H(X).  Por  this,  note  that  from  (5.1.)  we  have 


(5.2) 


|  |c^n)X.  -  C^'-’x.  II  <  M|  I Y  -  Y  , 
1  1  l  i  i  i  '  1  —  ' '  n  n' 


Since  { Y  }  is  convergent  and  hence  a  Cauchy  sequence  in  H(X),  it  follows  from 
(5.2)  that  {C.n)X.}  is  a  Cauchy  sequence  in  ll(X)  .  Thus  there  exists  I.  c  !I(X) 
such  that,  for  each  integer  i 

C^n)X.  -*•  2.,  in  II(X)  ,  as  n  -*■  «>. 

l  l  l 

But  since  c!"'x.  >"  sp  {X.}  and  2.  f  sp  {X.},  we  have  2.  =  C.X.  ,  for  some  matrix 

i  i  1  i  l  *  i  ill 

C  .  Therefore, 
i 

C.X.  =  lim  C(n)X.  . 
ii  ii 

n-*°° 

To  finish  the  proof  of  the  first  part  wc  need  to  show  that 


(5.3) 
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l  C.X.  e  H(X) , 


1  =  -°° 


and 


(5.4) 


V  =  l  C.X.  . 
. L  11 
1  —  —CO 


For  this  let  e  >  0  be  given,  then  we  can  choose  k^  >  0  such  that 


Y  -  V  ,  <  C,  whenever  n,  n'  > 

n  n'  1  1 


Now,  for  any  integers  mn  _<  m  ,  m,,  <_  m_,  by  triangle  inequality  and 

(n'), 


m  m_ 

>  (cfn)X.  -  C(n’3X.)  +  l  (C.(n3x.  -  C.in'JX.)i|  < 
1  1  1  1  . L  1  1  1  1  1  — 


i  =  m 


0 


i=m. 


1  ! !  > 

i  =m 


0 

Lett  ing  n '  -*•  <*\  we  get 

in. 


m3 
i=m. 


(cfn)X.  -  C(n’3X.)||  +  II  l  (C.(n)x.  -  cfn'3X.)||  <  2M(. 

i  i  l  l  1  1  '  . L  l  l  i  l  1  1 


1 


(5.5) 


l  ccfnJx.  -  C.X.)  +  l  (CWX.  -  C.X.)!  I  < 

. L  1  1  11  .  1  1  it  — 


i=nr. 


J  ,,(n), 

i=m. 


whenever  n  >  kr . 


By  choosing  and  large  enough  so  that 


(k  +13 

C.  '  X.  =  U ,  for  all  i  ^[m  ,m?]. 


we  get  from  f 5 . 3 )  that 


in. 


y  c.\.  +  y  c.x. i i  <  2Mt. 

11  .L  1  I  — 


i =  m . 


i  =m . 


therefore  {  \  1  i  i  ^ n - 1  a  Cauchy  sequence  in  II(X).  Since  H(X)  i 


l  =  -n 


k  . 

c 

(5.1)  we  have 


2  Me , 


s  complete 


this  proves  (5.3)  . 


It  remains  to  show  (5.4). 


To  this  end,  for  given  e  >  0,  it  follows  from  (.5.3)  that  there  exists 


\_  >0  such  that,  whenever  m^,m^  >  ,  then 

(5.6) 


1=m0 

Also,  for  n  >  we  have 
(5.7) 

i=ml 


1  3 

7  C . X.  +  T  C . X . | |  <  e ,  for  all  m_  <  m, ,  m.  <  m_ 
.L  i  i  .L  li11  0—12—3 


i=m. 


m. 


l  C(n)x.  -  l  C.X.  I  1  =  lim  ||  y  CU1JX.  -  y  C^n’ JX.  I  1 

i=m,  1  1  i  =m,  1  1  n—  1  if™  « 


(n), 


<  M  lim  j  |  Y  -  Y  , | |  <  Me, 
—  n  n'  '  1  — 


n'->oc 


(n ' ) 

where  the  first  equal  itv  holds  true  because  C.  X.  -*•  C.X.  ,  for  each  i,  and 
1  iiii  ’ 

the  inequalities  arc  the  result  of  (5.1)  and  the  choice  of  k^.  Now,  for  n  >  k 
we  can  choose  ni ^  and  m.,  large  enough  so  that  in  addition  to  (5.6)  we  have 


Y  =  l  C^njX.  =  l  cfn)X. 
n  Uni  1  1  if-oo  1  1 

(-  Cn)  , 

'  i 

interpreted  as  zero  matrix.  Then, 


where  C:n  's  not  present  in  the  original  representation  of  Y^  should  be 


I y  -  y  c.x. 

n  . L  li 

1  =-oo 


y  cfn)  -  y  c.x. i i  < 

L  1  .  L  _  1  l  — 


1  =  -°° 
m 


l=-m 


li  y 


C.(n)X.  I  I  +  ll  l  (CWX.  -  c.x. 
J  1  1  -Z  1  1  11 


.(n), 


i / f  m  j ,m, J 


i=m 


|  |  l  C  X  | |  <  (M  ♦  l)e. 

i^[m],m2] 

because  the  first  term  on  the  right  hand  side  of  the  inequality  is  zero  (recall 
hot>  m  and  id,  are  chosen),  the  second  term  is  less  than  Mf  by  (5.7),  and  the 
third  term  is  less  than  t  by  the  choice  of  ni  and  ro,  in  (5.5).  And  this 
establishes  (5.4). 


Finally,  to  show  the  uniqueness  of  a  representation  of  Y  in  the  sense  of 


Definition  5.2(h),  suppose  that 


V  =  y  C . X .  =  l  c.x. 

. L  11  . L  11 
L  =  -CO  1  =  ~oo 


We  have  from  (5.1)  that 


II  c.x.  -  c!x.  II  <  M 1 1 v  -  Y| I  =  n, 

t 

which  implies  C.X.  =  C.X.,  f°r  a  • 1  integers  i. 

To  prove  the  other  part  of  the  theorem,  assume  that  'X^l  is  a  general izcd 
Schauder  basis  and  define  the  space  S  by 

CO 

S  =  {{C'iVnL  :  I  C  X  CH(X)}. 

i  =  -0o 


It  is  clear  that  S  is  a  linear  space  and  the  following  defines  a  norm  on  S 

m_ 


|{C.X.  }T  I  I  =  sup  II);  C.X. 
11  i=-»'  1  S  *  1  1  .  ~  11 


i=m2 


H(X) 


Now,  by  using  the  ideas  in  the  first  part  of  the  proof  of  this  theorem,  one 
can  show  that  S  with  the  norm  defined  above  is  a  Banach  space.  Consider  the 
operator  T:S  -k  l!(X)  defined  by 


.  +  '°  r* 

T  (C.X.-.  )  =  )  C.X. 

l  i  i=-«  . L  ii 


l  =  -oo 


By  using  the  two  defining  properties  of  a  generalized  Schauder  basis  it  can 
be  shown  that  1  is  a  one-to-on  and  onto  operator.  Furthermore,  T  is  bounded, 
because 


!  i  t  ( i  c .  \ .  n 

i  i 


H(X  ) 


c.x. 

1  I 


H(X) 


1  i  m 

n->*  •> 


n 


II.)'  vji 


ll(X) 


ih< 


.  x.  } 

I  I 
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Thus  by  the  open  mapping  theorem  T  *  is  bounded.  By  choosing  M  =  | | T  * 


we  get 


IHc  x  Ml  i  Mil  l  c  x,|| 

i — v  ' 


1=-°° 


or  equivalently 


y  C.X.||  <  M|  |  y  CX.||,  for  all  m  <  n,  which  is  the  same 


i=m 


l  =  -0O 


as  (5  .  1 J 


Q.li.l). 
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